Semantic Motion Concept Retrieval in Non-Static Background Utilizing Spatial-Temporal Visual Information
نویسندگان
چکیده
Motion concepts mean those concepts containing motion information such as racing car and dancing. In order to achieve high retrieval accuracy comparing with those static concepts such as car or person in semantic retrieval tasks, the temporal information has to be considered. Additionally, if a video sequence is captured by an amateur using a hand-held camera containing signi ̄cant camera motion, the complexities of the uncontrolled backgrounds would aggravate the di±culty of motion concept retrieval. Therefore, the retrieval of semantic concepts containing motion in non-static background is regarded as one of the most challenging tasks in multimedia semantic analysis and video retrieval. To address such a challenge, this paper proposes a motion concept retrieval framework including a motion region detection model and a concept retrieval model that integrates the spatial and temporal information in video sequences. The motion region detection model uses a new integral density method (adopted from the idea of integral images) to quickly identify the motion regions in an unsupervised way. Specially, key information locations on video frames are ̄rst obtained as maxima and minima of the result of Di®erence of Gaussian (DoG) function. Then a motion map of adjacent frames is generated from the diversity of the outcomes from the Simultaneous Partition and Class Parameter Estimation (SPCPE) framework. The usage of the motion map is to ̄lter key information locations into key motion locations (KMLs) that imply the regions containing motion. The motion map can also indicate the motion direction which guides the proposed \integral density" approach to locate the motion regions quickly and accurately. Based on the motion region detection model, moving object-level information is extracted for semantic retrieval. In the proposed conceptual retrieval model, temporally semantic consistency among the consecutive shots is analyzed and presented into a conditional probability model, which is then used to re-rank the similarity scores to improve the ̄nal retrieval results. The results of our proposed novel motion concept retrieval framework are not only illustrated visually demonstrating its robustness in non-static background, but also veri ̄ed by the promising experimental results demonstrating that the concept retrieval performance can be improved by integrating the spatial and temporal visual information.
منابع مشابه
Developing a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information
With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...
متن کاملUser’s Visual Intention Driven Retrieval Model for Semantic Video Search and Retrieval
There are various models of retrieval mechanisms for effective video retrieval, however, ‘semantic gap’ still persists. This study aims to reduce the gap by providing a graphical user interface that allows user to express the visuals in their mind into a query using semantic association of spatial and temporal visual concepts. To accomplish this, we have developed a video content depiction sche...
متن کاملRecognition of Visual Events using Spatio-Temporal Information of the Video Signal
Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...
متن کاملSpatio-temporal analysis of the covid-19 impacts on the using Chicago urban shared bicycles by tensor-based approach
Cycling is a phenomenon in urban transportation that has the ability to allocate a specific location at any moment in time. Accordingly, spatial analysis of bicycle trips can be accompanied by temporal analysis. The use of a GIS environment is commonly recommended to display the extent of the phenomenon's spatial changes. However, in order to apply and display changes over time, it will requir...
متن کاملWhat Do I See? Modeling Human Visual Perception for Multi-person Tracking
This paper presents a novel approach for multi-person tracking utilizing a model motivated by the human vision system. The model predicts human motion based on modeling of perceived information. An attention map is designed to mimic human reasoning that integrates both spatial and temporal information. The spatial component addresses human attention allocation to different areas in a scene and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Int. J. Semantic Computing
دوره 7 شماره
صفحات -
تاریخ انتشار 2013